Dataset statistics
| Number of variables | 25 |
|---|---|
| Number of observations | 45811 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 8.7 MiB |
| Average record size in memory | 200.0 B |
Variable types
| Numeric | 13 |
|---|---|
| Categorical | 12 |
ASSESSMENT_NBHD has a high cardinality: 55 distinct values | High cardinality |
Unnamed: 0 is highly correlated with ZIPCODE and 1 other fields | High correlation |
BATHRM is highly correlated with PRICE | High correlation |
PRICE is highly correlated with BATHRM and 1 other fields | High correlation |
FIREPLACES is highly correlated with PRICE | High correlation |
ZIPCODE is highly correlated with Unnamed: 0 | High correlation |
LATITUDE is highly correlated with LONGITUDE | High correlation |
LONGITUDE is highly correlated with Unnamed: 0 and 1 other fields | High correlation |
Unnamed: 0 is highly correlated with PRICE and 2 other fields | High correlation |
BATHRM is highly correlated with PRICE | High correlation |
PRICE is highly correlated with Unnamed: 0 and 2 other fields | High correlation |
ZIPCODE is highly correlated with Unnamed: 0 | High correlation |
LATITUDE is highly correlated with LONGITUDE | High correlation |
LONGITUDE is highly correlated with Unnamed: 0 and 2 other fields | High correlation |
Unnamed: 0 is highly correlated with ZIPCODE | High correlation |
ZIPCODE is highly correlated with Unnamed: 0 | High correlation |
AC is highly correlated with EYB and 2 other fields | High correlation |
BATHRM is highly correlated with PRICE | High correlation |
GRADE is highly correlated with QUADRANT and 6 other fields | High correlation |
QUADRANT is highly correlated with GRADE and 6 other fields | High correlation |
YR_RMDL is highly correlated with EYB and 1 other fields | High correlation |
ASSESSMENT_NBHD is highly correlated with GRADE and 11 other fields | High correlation |
ZIPCODE is highly correlated with QUADRANT and 7 other fields | High correlation |
EYB is highly correlated with AC and 4 other fields | High correlation |
EXTWALL is highly correlated with ASSESSMENT_NBHD | High correlation |
HEAT is highly correlated with AC | High correlation |
LANDAREA is highly correlated with PRICE | High correlation |
LONGITUDE is highly correlated with GRADE and 6 other fields | High correlation |
PRICE is highly correlated with BATHRM and 2 other fields | High correlation |
WARD is highly correlated with GRADE and 8 other fields | High correlation |
KITCHENS is highly correlated with STRUCT | High correlation |
ROOF is highly correlated with ASSESSMENT_NBHD and 4 other fields | High correlation |
STRUCT is highly correlated with ASSESSMENT_NBHD and 4 other fields | High correlation |
STYLE is highly correlated with ASSESSMENT_NBHD | High correlation |
Unnamed: 0 is highly correlated with GRADE and 7 other fields | High correlation |
LATITUDE is highly correlated with QUADRANT and 5 other fields | High correlation |
CNDTN is highly correlated with AC and 2 other fields | High correlation |
AC is highly correlated with HEAT | High correlation |
WARD is highly correlated with QUADRANT and 1 other fields | High correlation |
QUADRANT is highly correlated with WARD and 1 other fields | High correlation |
ASSESSMENT_NBHD is highly correlated with WARD and 1 other fields | High correlation |
HEAT is highly correlated with AC | High correlation |
Unnamed: 0 has unique values | Unique |
HF_BATHRM has 17530 (38.3%) zeros | Zeros |
YR_RMDL has 18129 (39.6%) zeros | Zeros |
FIREPLACES has 24362 (53.2%) zeros | Zeros |
Reproduction
| Analysis started | 2021-07-10 09:23:08.351452 |
|---|---|
| Analysis finished | 2021-07-10 09:23:48.049989 |
| Duration | 39.7 seconds |
| Software version | pandas-profiling v3.0.0 |
| Download configuration | config.json |
Unnamed: 0
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONUNIQUE| Distinct | 45811 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 49924.12436 |
| Minimum | 0 |
|---|---|
| Maximum | 106687 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 358.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 4760.5 |
| Q1 | 22999.5 |
| median | 48103 |
| Q3 | 76620.5 |
| 95-th percentile | 100330 |
| Maximum | 106687 |
| Range | 106687 |
| Interquartile range (IQR) | 53621 |
Descriptive statistics
| Standard deviation | 30672.10125 |
|---|---|
| Coefficient of variation (CV) | 0.6143743459 |
| Kurtosis | -1.195823393 |
| Mean | 49924.12436 |
| Median Absolute Deviation (MAD) | 26650 |
| Skewness | 0.1388722242 |
| Sum | 2287074061 |
| Variance | 940777795.1 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 94168 | 1 | < 0.1% |
| 4823 | 1 | < 0.1% |
| 25305 | 1 | < 0.1% |
| 96986 | 1 | < 0.1% |
| 19164 | 1 | < 0.1% |
| 17117 | 1 | < 0.1% |
| 88798 | 1 | < 0.1% |
| 41697 | 1 | < 0.1% |
| 47842 | 1 | < 0.1% |
| Other values (45801) | 45801 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 5 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 14 | 1 | |
| 16 | 1 | |
| 19 | 1 | |
| 22 | 1 |
| Value | Count | Frequency (%) |
| 106687 | 1 | |
| 106673 | 1 | |
| 106672 | 1 | |
| 106668 | 1 | |
| 106666 | 1 | |
| 106664 | 1 | |
| 106663 | 1 | |
| 106662 | 1 | |
| 106657 | 1 | |
| 106656 | 1 |
| Distinct | 13 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.246862107 |
| Minimum | 0 |
|---|---|
| Maximum | 12 |
| Zeros | 5 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 358.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 4 |
| Maximum | 12 |
| Range | 12 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.048311669 |
|---|---|
| Coefficient of variation (CV) | 0.4665669804 |
| Kurtosis | 2.006920455 |
| Mean | 2.246862107 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.9360509198 |
| Sum | 102931 |
| Variance | 1.098957355 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 17218 | |
| 1 | 11920 | |
| 3 | 11690 | |
| 4 | 3910 | 8.5% |
| 5 | 715 | 1.6% |
| 6 | 248 | 0.5% |
| 7 | 68 | 0.1% |
| 8 | 21 | < 0.1% |
| 9 | 7 | < 0.1% |
| 0 | 5 | < 0.1% |
| Other values (3) | 9 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 5 | < 0.1% |
| 1 | 11920 | |
| 2 | 17218 | |
| 3 | 11690 | |
| 4 | 3910 | 8.5% |
| 5 | 715 | 1.6% |
| 6 | 248 | 0.5% |
| 7 | 68 | 0.1% |
| 8 | 21 | < 0.1% |
| 9 | 7 | < 0.1% |
| Value | Count | Frequency (%) |
| 12 | 1 | < 0.1% |
| 11 | 3 | < 0.1% |
| 10 | 5 | < 0.1% |
| 9 | 7 | < 0.1% |
| 8 | 21 | < 0.1% |
| 7 | 68 | 0.1% |
| 6 | 248 | 0.5% |
| 5 | 715 | 1.6% |
| 4 | 3910 | 8.5% |
| 3 | 11690 |
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.6840496824 |
| Minimum | 0 |
|---|---|
| Maximum | 7 |
| Zeros | 17530 |
| Zeros (%) | 38.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 358.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 7 |
| Range | 7 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.5990047611 |
|---|---|
| Coefficient of variation (CV) | 0.8756743502 |
| Kurtosis | 0.6323083574 |
| Mean | 0.6840496824 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.4278787713 |
| Sum | 31337 |
| Variance | 0.3588067038 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 25388 | |
| 0 | 17530 | |
| 2 | 2767 | 6.0% |
| 3 | 98 | 0.2% |
| 4 | 21 | < 0.1% |
| 5 | 6 | < 0.1% |
| 7 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 17530 | |
| 1 | 25388 | |
| 2 | 2767 | 6.0% |
| 3 | 98 | 0.2% |
| 4 | 21 | < 0.1% |
| 5 | 6 | < 0.1% |
| 7 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 7 | 1 | < 0.1% |
| 5 | 6 | < 0.1% |
| 4 | 21 | < 0.1% |
| 3 | 98 | 0.2% |
| 2 | 2767 | 6.0% |
| 1 | 25388 | |
| 0 | 17530 |
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 358.0 KiB |
| Forced Air | |
|---|---|
| Hot Water Rad | |
| Warm Cool | |
| Ht Pump | 662 |
| Water Base Brd | 79 |
| Other values (9) | 225 |
Length
| Max length | 14 |
|---|---|
| Median length | 10 |
| Mean length | 10.70369562 |
| Min length | 7 |
Characters and Unicode
| Total characters | 490347 |
|---|---|
| Distinct characters | 36 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Warm Cool |
|---|---|
| 2nd row | Hot Water Rad |
| 3rd row | Hot Water Rad |
| 4th row | Hot Water Rad |
| 5th row | Hot Water Rad |
Common Values
| Value | Count | Frequency (%) |
| Forced Air | 18112 | |
| Hot Water Rad | 15086 | |
| Warm Cool | 11647 | |
| Ht Pump | 662 | 1.4% |
| Water Base Brd | 79 | 0.2% |
| Wall Furnace | 57 | 0.1% |
| Elec Base Brd | 52 | 0.1% |
| Electric Rad | 28 | 0.1% |
| Gravity Furnac | 27 | 0.1% |
| Air-Oil | 26 | 0.1% |
| Other values (4) | 35 | 0.1% |
Length
| Value | Count | Frequency (%) |
| air | 18121 | |
| forced | 18112 | |
| water | 15165 | |
| rad | 15114 | |
| hot | 15086 | |
| cool | 11658 | |
| warm | 11647 | |
| ht | 662 | 0.6% |
| pump | 662 | 0.6% |
| base | 131 | 0.1% |
| Other values (14) | 455 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 63341 | |
| 61002 | ||
| o | 56522 | |
| a | 42241 | 8.6% |
| e | 33545 | 6.8% |
| d | 33364 | 6.8% |
| t | 30983 | 6.3% |
| W | 26869 | 5.5% |
| c | 18313 | 3.7% |
| i | 18235 | 3.7% |
| Other values (26) | 105932 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 322480 | |
| Uppercase Letter | 106839 | 21.8% |
| Space Separator | 61002 | 12.4% |
| Dash Punctuation | 26 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 63341 | |
| o | 56522 | |
| a | 42241 | |
| e | 33545 | |
| d | 33364 | |
| t | 30983 | |
| c | 18313 | 5.7% |
| i | 18235 | 5.7% |
| m | 12309 | 3.8% |
| l | 11878 | 3.7% |
| Other values (9) | 1749 | 0.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 26869 | |
| F | 18196 | |
| A | 18147 | |
| H | 15748 | |
| R | 15114 | |
| C | 11658 | |
| P | 662 | 0.6% |
| B | 262 | 0.2% |
| E | 100 | 0.1% |
| G | 27 | < 0.1% |
| Other values (5) | 56 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 61002 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 26 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 429319 | |
| Common | 61028 | 12.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 63341 | |
| o | 56522 | |
| a | 42241 | |
| e | 33545 | 7.8% |
| d | 33364 | 7.8% |
| t | 30983 | 7.2% |
| W | 26869 | 6.3% |
| c | 18313 | 4.3% |
| i | 18235 | 4.2% |
| F | 18196 | 4.2% |
| Other values (24) | 87710 |
Common
| Value | Count | Frequency (%) |
| 61002 | ||
| - | 26 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 490347 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 63341 | |
| 61002 | ||
| o | 56522 | |
| a | 42241 | 8.6% |
| e | 33545 | 6.8% |
| d | 33364 | 6.8% |
| t | 30983 | 6.3% |
| W | 26869 | 5.5% |
| c | 18313 | 3.7% |
| i | 18235 | 3.7% |
| Other values (26) | 105932 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 358.0 KiB |
| 1 | |
|---|---|
| 0 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 45811 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 35227 | |
| 0 | 10584 | 23.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 1 | 35227 | |
| 0 | 10584 | 23.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 35227 | |
| 0 | 10584 | 23.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 45811 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 35227 | |
| 0 | 10584 | 23.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 45811 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 35227 | |
| 0 | 10584 | 23.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 45811 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 35227 | |
| 0 | 10584 | 23.1% |
| Distinct | 97 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1211.445286 |
| Minimum | 0 |
|---|---|
| Maximum | 2019 |
| Zeros | 18129 |
| Zeros (%) | 39.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 358.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1999 |
| Q3 | 2010 |
| 95-th percentile | 2016 |
| Maximum | 2019 |
| Range | 2019 |
| Interquartile range (IQR) | 2010 |
Descriptive statistics
| Standard deviation | 980.4313274 |
|---|---|
| Coefficient of variation (CV) | 0.8093071463 |
| Kurtosis | -1.818079808 |
| Mean | 1211.445286 |
| Median Absolute Deviation (MAD) | 17 |
| Skewness | -0.4261638437 |
| Sum | 55497520 |
| Variance | 961245.5877 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 18129 | |
| 2011 | 1656 | 3.6% |
| 2013 | 1645 | 3.6% |
| 2014 | 1576 | 3.4% |
| 2012 | 1549 | 3.4% |
| 2010 | 1545 | 3.4% |
| 2015 | 1480 | 3.2% |
| 2004 | 1446 | 3.2% |
| 2005 | 1389 | 3.0% |
| 2006 | 1314 | 2.9% |
| Other values (87) | 14082 |
| Value | Count | Frequency (%) |
| 0 | 18129 | |
| 1880 | 1 | < 0.1% |
| 1900 | 1 | < 0.1% |
| 1910 | 1 | < 0.1% |
| 1920 | 1 | < 0.1% |
| 1923 | 1 | < 0.1% |
| 1925 | 3 | < 0.1% |
| 1926 | 1 | < 0.1% |
| 1927 | 1 | < 0.1% |
| 1928 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 2019 | 1 | < 0.1% |
| 2018 | 309 | 0.7% |
| 2017 | 1185 | |
| 2016 | 1277 | |
| 2015 | 1480 | |
| 2014 | 1576 | |
| 2013 | 1645 | |
| 2012 | 1549 | |
| 2011 | 1656 | |
| 2010 | 1545 |
| Distinct | 80 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1971.091441 |
| Minimum | 1928 |
|---|---|
| Maximum | 2018 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 358.0 KiB |
Quantile statistics
| Minimum | 1928 |
|---|---|
| 5-th percentile | 1950 |
| Q1 | 1960 |
| median | 1967 |
| Q3 | 1978 |
| 95-th percentile | 2011 |
| Maximum | 2018 |
| Range | 90 |
| Interquartile range (IQR) | 18 |
Descriptive statistics
| Standard deviation | 16.9704238 |
|---|---|
| Coefficient of variation (CV) | 0.008609658307 |
| Kurtosis | 0.8640949167 |
| Mean | 1971.091441 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | 1.135924983 |
| Sum | 90297670 |
| Variance | 287.9952839 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1967 | 5859 | 12.8% |
| 1964 | 4950 | 10.8% |
| 1969 | 4361 | 9.5% |
| 1954 | 3242 | 7.1% |
| 1957 | 3160 | 6.9% |
| 1960 | 1811 | 4.0% |
| 1972 | 1776 | 3.9% |
| 1978 | 1228 | 2.7% |
| 1943 | 1010 | 2.2% |
| 1982 | 986 | 2.2% |
| Other values (70) | 17428 |
| Value | Count | Frequency (%) |
| 1928 | 1 | < 0.1% |
| 1932 | 1 | < 0.1% |
| 1936 | 3 | < 0.1% |
| 1940 | 2 | < 0.1% |
| 1943 | 1010 | |
| 1944 | 20 | < 0.1% |
| 1945 | 21 | < 0.1% |
| 1946 | 28 | 0.1% |
| 1947 | 694 | |
| 1948 | 35 | 0.1% |
| Value | Count | Frequency (%) |
| 2018 | 103 | 0.2% |
| 2017 | 440 | |
| 2016 | 233 | |
| 2015 | 575 | |
| 2014 | 330 | |
| 2013 | 318 | |
| 2012 | 251 | |
| 2011 | 433 | |
| 2010 | 450 | |
| 2009 | 97 | 0.2% |
| Distinct | 7173 |
|---|---|
| Distinct (%) | 15.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 628380.4609 |
| Minimum | 250 |
|---|---|
| Maximum | 22000000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 358.0 KiB |
Quantile statistics
| Minimum | 250 |
|---|---|
| 5-th percentile | 115464 |
| Q1 | 290000 |
| median | 507000 |
| Q3 | 800000 |
| 95-th percentile | 1500000 |
| Maximum | 22000000 |
| Range | 21999750 |
| Interquartile range (IQR) | 510000 |
Descriptive statistics
| Standard deviation | 571595.0969 |
|---|---|
| Coefficient of variation (CV) | 0.9096321934 |
| Kurtosis | 104.842387 |
| Mean | 628380.4609 |
| Median Absolute Deviation (MAD) | 246000 |
| Skewness | 5.98541281 |
| Sum | 2.878673729 × 1010 |
| Variance | 3.267209548 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 350000 | 317 | 0.7% |
| 250000 | 245 | 0.5% |
| 550000 | 234 | 0.5% |
| 450000 | 232 | 0.5% |
| 650000 | 228 | 0.5% |
| 300000 | 220 | 0.5% |
| 325000 | 219 | 0.5% |
| 600000 | 219 | 0.5% |
| 320000 | 215 | 0.5% |
| 750000 | 209 | 0.5% |
| Other values (7163) | 43473 |
| Value | Count | Frequency (%) |
| 250 | 2 | |
| 4850 | 2 | |
| 7425 | 1 | |
| 7500 | 1 | |
| 10400 | 1 | |
| 11000 | 1 | |
| 14800 | 1 | |
| 15000 | 2 | |
| 17000 | 1 | |
| 20500 | 1 |
| Value | Count | Frequency (%) |
| 22000000 | 1 | |
| 18000000 | 1 | |
| 15000000 | 1 | |
| 12250000 | 1 | |
| 11984000 | 1 | |
| 11111111 | 1 | |
| 10750000 | 1 | |
| 9000000 | 1 | |
| 8600000 | 1 | |
| 8450000 | 2 |
SALE_NUM
Real number (ℝ≥0)
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.922507695 |
| Minimum | 1 |
|---|---|
| Maximum | 15 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 358.0 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 3 |
| 95-th percentile | 5 |
| Maximum | 15 |
| Range | 14 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.498678034 |
|---|---|
| Coefficient of variation (CV) | 0.7795433217 |
| Kurtosis | 3.179325077 |
| Mean | 1.922507695 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.752193813 |
| Sum | 88072 |
| Variance | 2.246035851 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 29638 | |
| 3 | 5339 | 11.7% |
| 4 | 3956 | 8.6% |
| 2 | 3450 | 7.5% |
| 5 | 1921 | 4.2% |
| 6 | 905 | 2.0% |
| 7 | 348 | 0.8% |
| 8 | 141 | 0.3% |
| 9 | 71 | 0.2% |
| 10 | 23 | 0.1% |
| Other values (4) | 19 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 29638 | |
| 2 | 3450 | 7.5% |
| 3 | 5339 | 11.7% |
| 4 | 3956 | 8.6% |
| 5 | 1921 | 4.2% |
| 6 | 905 | 2.0% |
| 7 | 348 | 0.8% |
| 8 | 141 | 0.3% |
| 9 | 71 | 0.2% |
| 10 | 23 | 0.1% |
| Value | Count | Frequency (%) |
| 15 | 2 | < 0.1% |
| 13 | 1 | < 0.1% |
| 12 | 6 | < 0.1% |
| 11 | 10 | < 0.1% |
| 10 | 23 | 0.1% |
| 9 | 71 | 0.2% |
| 8 | 141 | 0.3% |
| 7 | 348 | 0.8% |
| 6 | 905 | |
| 5 | 1921 |
| Distinct | 17 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 358.0 KiB |
| 2 Story | |
|---|---|
| 3 Story | |
| 2.5 Story Fin | 3310 |
| 1 Story | 1388 |
| 1.5 Story Fin | 842 |
| Other values (12) | 854 |
Length
| Max length | 15 |
|---|---|
| Median length | 7 |
| Mean length | 7.63421449 |
| Min length | 6 |
Characters and Unicode
| Total characters | 349731 |
|---|---|
| Distinct characters | 29 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 3 Story |
|---|---|
| 2nd row | 3 Story |
| 3rd row | 3 Story |
| 4th row | 4 Story |
| 5th row | 3 Story |
Common Values
| Value | Count | Frequency (%) |
| 2 Story | 34840 | |
| 3 Story | 4577 | 10.0% |
| 2.5 Story Fin | 3310 | 7.2% |
| 1 Story | 1388 | 3.0% |
| 1.5 Story Fin | 842 | 1.8% |
| 2.5 Story Unfin | 303 | 0.7% |
| 4 Story | 187 | 0.4% |
| Split Level | 132 | 0.3% |
| Split Foyer | 88 | 0.2% |
| 3.5 Story Fin | 71 | 0.2% |
| Other values (7) | 73 | 0.2% |
Length
| Value | Count | Frequency (%) |
| story | 45569 | |
| 2 | 34840 | |
| 3 | 4577 | 4.8% |
| fin | 4224 | 4.4% |
| 2.5 | 3613 | 3.8% |
| 1 | 1388 | 1.4% |
| 1.5 | 885 | 0.9% |
| unfin | 353 | 0.4% |
| split | 220 | 0.2% |
| 4 | 187 | 0.2% |
| Other values (7) | 321 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 50366 | ||
| t | 45803 | |
| S | 45789 | |
| o | 45657 | |
| r | 45657 | |
| y | 45657 | |
| 2 | 38453 | |
| n | 4932 | 1.4% |
| i | 4805 | 1.4% |
| 3 | 4653 | 1.3% |
| Other values (19) | 17959 | 5.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 194018 | |
| Uppercase Letter | 50616 | 14.5% |
| Space Separator | 50366 | 14.4% |
| Decimal Number | 50146 | 14.3% |
| Other Punctuation | 4577 | 1.3% |
| Dash Punctuation | 8 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 45803 | |
| o | 45657 | |
| r | 45657 | |
| y | 45657 | |
| n | 4932 | 2.5% |
| i | 4805 | 2.5% |
| e | 380 | 0.2% |
| l | 372 | 0.2% |
| f | 365 | 0.2% |
| p | 220 | 0.1% |
| Other values (4) | 170 | 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 45789 | |
| F | 4312 | 8.5% |
| U | 353 | 0.7% |
| L | 140 | 0.3% |
| D | 12 | < 0.1% |
| B | 8 | < 0.1% |
| V | 2 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 38453 | |
| 3 | 4653 | 9.3% |
| 5 | 4577 | 9.1% |
| 1 | 2273 | 4.5% |
| 4 | 190 | 0.4% |
Space Separator
| Value | Count | Frequency (%) |
| 50366 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4577 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 8 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 244634 | |
| Common | 105097 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 45803 | |
| S | 45789 | |
| o | 45657 | |
| r | 45657 | |
| y | 45657 | |
| n | 4932 | 2.0% |
| i | 4805 | 2.0% |
| F | 4312 | 1.8% |
| e | 380 | 0.2% |
| l | 372 | 0.2% |
| Other values (11) | 1270 | 0.5% |
Common
| Value | Count | Frequency (%) |
| 50366 | ||
| 2 | 38453 | |
| 3 | 4653 | 4.4% |
| . | 4577 | 4.4% |
| 5 | 4577 | 4.4% |
| 1 | 2273 | 2.2% |
| 4 | 190 | 0.2% |
| - | 8 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 349731 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 50366 | ||
| t | 45803 | |
| S | 45789 | |
| o | 45657 | |
| r | 45657 | |
| y | 45657 | |
| 2 | 38453 | |
| n | 4932 | 1.4% |
| i | 4805 | 1.4% |
| 3 | 4653 | 1.3% |
| Other values (19) | 17959 | 5.1% |
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 358.0 KiB |
| Row Inside | |
|---|---|
| Single | |
| Semi-Detached | |
| Row End | |
| Multi | 1732 |
| Other values (3) | 220 |
Length
| Max length | 13 |
|---|---|
| Median length | 10 |
| Mean length | 8.756543188 |
| Min length | 5 |
Characters and Unicode
| Total characters | 401146 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Row Inside |
|---|---|
| 2nd row | Row Inside |
| 3rd row | Row Inside |
| 4th row | Row Inside |
| 5th row | Row Inside |
Common Values
| Value | Count | Frequency (%) |
| Row Inside | 19029 | |
| Single | 12809 | |
| Semi-Detached | 6496 | 14.2% |
| Row End | 5525 | 12.1% |
| Multi | 1732 | 3.8% |
| Town Inside | 154 | 0.3% |
| Town End | 63 | 0.1% |
| Default | 3 | < 0.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| row | 24554 | |
| inside | 19183 | |
| single | 12809 | |
| semi-detached | 6496 | 9.2% |
| end | 5588 | 7.9% |
| multi | 1732 | 2.5% |
| town | 217 | 0.3% |
| default | 3 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 51483 | |
| i | 40220 | 10.0% |
| n | 37797 | 9.4% |
| d | 31267 | 7.8% |
| o | 24771 | 6.2% |
| w | 24771 | 6.2% |
| 24771 | 6.2% | |
| R | 24554 | 6.1% |
| S | 19305 | 4.8% |
| I | 19183 | 4.8% |
| Other values (15) | 103024 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 292801 | |
| Uppercase Letter | 77078 | 19.2% |
| Space Separator | 24771 | 6.2% |
| Dash Punctuation | 6496 | 1.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 51483 | |
| i | 40220 | |
| n | 37797 | |
| d | 31267 | |
| o | 24771 | |
| w | 24771 | |
| s | 19183 | 6.6% |
| l | 14544 | 5.0% |
| g | 12809 | 4.4% |
| t | 8231 | 2.8% |
| Other values (6) | 27725 |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 24554 | |
| S | 19305 | |
| I | 19183 | |
| D | 6499 | 8.4% |
| E | 5588 | 7.2% |
| M | 1732 | 2.2% |
| T | 217 | 0.3% |
Space Separator
| Value | Count | Frequency (%) |
| 24771 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 6496 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 369879 | |
| Common | 31267 | 7.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 51483 | |
| i | 40220 | |
| n | 37797 | |
| d | 31267 | |
| o | 24771 | 6.7% |
| w | 24771 | 6.7% |
| R | 24554 | 6.6% |
| S | 19305 | 5.2% |
| I | 19183 | 5.2% |
| s | 19183 | 5.2% |
| Other values (13) | 77345 |
Common
| Value | Count | Frequency (%) |
| 24771 | ||
| - | 6496 | 20.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 401146 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 51483 | |
| i | 40220 | 10.0% |
| n | 37797 | 9.4% |
| d | 31267 | 7.8% |
| o | 24771 | 6.2% |
| w | 24771 | 6.2% |
| 24771 | 6.2% | |
| R | 24554 | 6.1% |
| S | 19305 | 4.8% |
| I | 19183 | 4.8% |
| Other values (15) | 103024 |
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 358.0 KiB |
| Average | |
|---|---|
| Above Average | |
| Good Quality | |
| Very Good | |
| Excellent | |
| Other values (7) |
Length
| Max length | 13 |
|---|---|
| Median length | 12 |
| Mean length | 10.23112353 |
| Min length | 7 |
Characters and Unicode
| Total characters | 468698 |
|---|---|
| Distinct characters | 31 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Very Good |
|---|---|
| 2nd row | Very Good |
| 3rd row | Very Good |
| 4th row | Very Good |
| 5th row | Very Good |
Common Values
| Value | Count | Frequency (%) |
| Average | 14432 | |
| Above Average | 13627 | |
| Good Quality | 9854 | |
| Very Good | 4363 | 9.5% |
| Excellent | 1574 | 3.4% |
| Superior | 1321 | 2.9% |
| Exceptional-A | 395 | 0.9% |
| Exceptional-B | 145 | 0.3% |
| Fair Quality | 42 | 0.1% |
| Exceptional-C | 32 | 0.1% |
| Other values (2) | 26 | 0.1% |
Length
| Value | Count | Frequency (%) |
| average | 28059 | |
| good | 14217 | |
| above | 13627 | |
| quality | 9898 | 13.4% |
| very | 4363 | 5.9% |
| excellent | 1574 | 2.1% |
| superior | 1321 | 1.8% |
| exceptional-a | 395 | 0.5% |
| exceptional-b | 145 | 0.2% |
| fair | 42 | 0.1% |
| Other values (3) | 58 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 79173 | |
| o | 43980 | |
| A | 42081 | |
| v | 41686 | |
| a | 38595 | 8.2% |
| r | 35106 | 7.5% |
| g | 28059 | 6.0% |
| 27888 | 6.0% | |
| y | 14261 | 3.0% |
| G | 14217 | 3.0% |
| Other values (21) | 103652 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 365919 | |
| Uppercase Letter | 74295 | 15.9% |
| Space Separator | 27888 | 6.0% |
| Dash Punctuation | 596 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 79173 | |
| o | 43980 | |
| v | 41686 | |
| a | 38595 | |
| r | 35106 | |
| g | 28059 | 7.7% |
| y | 14261 | 3.9% |
| d | 14217 | 3.9% |
| l | 13642 | 3.7% |
| b | 13627 | 3.7% |
| Other values (8) | 43573 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 42081 | |
| G | 14217 | 19.1% |
| Q | 9898 | 13.3% |
| V | 4363 | 5.9% |
| E | 2170 | 2.9% |
| S | 1321 | 1.8% |
| B | 145 | 0.2% |
| F | 42 | 0.1% |
| C | 32 | < 0.1% |
| D | 24 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 27888 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 596 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 440214 | |
| Common | 28484 | 6.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 79173 | |
| o | 43980 | |
| A | 42081 | |
| v | 41686 | |
| a | 38595 | |
| r | 35106 | 8.0% |
| g | 28059 | 6.4% |
| y | 14261 | 3.2% |
| G | 14217 | 3.2% |
| d | 14217 | 3.2% |
| Other values (19) | 88839 |
Common
| Value | Count | Frequency (%) |
| 27888 | ||
| - | 596 | 2.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 468698 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 79173 | |
| o | 43980 | |
| A | 42081 | |
| v | 41686 | |
| a | 38595 | 8.2% |
| r | 35106 | 7.5% |
| g | 28059 | 6.0% |
| 27888 | 6.0% | |
| y | 14261 | 3.0% |
| G | 14217 | 3.0% |
| Other values (21) | 103652 |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 358.0 KiB |
| Good | |
|---|---|
| Average | |
| Very Good | |
| Excellent | 772 |
| Fair | 227 |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 5.84962127 |
| Min length | 4 |
Characters and Unicode
| Total characters | 267977 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Good |
|---|---|
| 2nd row | Very Good |
| 3rd row | Good |
| 4th row | Good |
| 5th row | Average |
Common Values
| Value | Count | Frequency (%) |
| Good | 21897 | |
| Average | 16771 | |
| Very Good | 6112 | 13.3% |
| Excellent | 772 | 1.7% |
| Fair | 227 | 0.5% |
| Poor | 32 | 0.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| good | 28009 | |
| average | 16771 | |
| very | 6112 | 11.8% |
| excellent | 772 | 1.5% |
| fair | 227 | 0.4% |
| poor | 32 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 56082 | |
| e | 41198 | |
| G | 28009 | |
| d | 28009 | |
| r | 23142 | |
| a | 16998 | 6.3% |
| A | 16771 | 6.3% |
| v | 16771 | 6.3% |
| g | 16771 | 6.3% |
| V | 6112 | 2.3% |
| Other values (11) | 18114 | 6.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 209942 | |
| Uppercase Letter | 51923 | 19.4% |
| Space Separator | 6112 | 2.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 56082 | |
| e | 41198 | |
| d | 28009 | |
| r | 23142 | |
| a | 16998 | 8.1% |
| v | 16771 | 8.0% |
| g | 16771 | 8.0% |
| y | 6112 | 2.9% |
| l | 1544 | 0.7% |
| x | 772 | 0.4% |
| Other values (4) | 2543 | 1.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 28009 | |
| A | 16771 | |
| V | 6112 | 11.8% |
| E | 772 | 1.5% |
| F | 227 | 0.4% |
| P | 32 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 6112 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 261865 | |
| Common | 6112 | 2.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 56082 | |
| e | 41198 | |
| G | 28009 | |
| d | 28009 | |
| r | 23142 | |
| a | 16998 | 6.5% |
| A | 16771 | 6.4% |
| v | 16771 | 6.4% |
| g | 16771 | 6.4% |
| V | 6112 | 2.3% |
| Other values (10) | 12002 | 4.6% |
Common
| Value | Count | Frequency (%) |
| 6112 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 267977 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 56082 | |
| e | 41198 | |
| G | 28009 | |
| d | 28009 | |
| r | 23142 | |
| a | 16998 | 6.3% |
| A | 16771 | 6.3% |
| v | 16771 | 6.3% |
| g | 16771 | 6.3% |
| V | 6112 | 2.3% |
| Other values (11) | 18114 | 6.8% |
| Distinct | 23 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 358.0 KiB |
| Common Brick | |
|---|---|
| Brick/Siding | 2915 |
| Vinyl Siding | 2317 |
| Wood Siding | 1767 |
| Stucco | 1292 |
| Other values (18) | 3087 |
Length
| Max length | 14 |
|---|---|
| Median length | 12 |
| Mean length | 11.648687 |
| Min length | 5 |
Characters and Unicode
| Total characters | 533638 |
|---|---|
| Distinct characters | 32 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Common Brick |
|---|---|
| 2nd row | Common Brick |
| 3rd row | Common Brick |
| 4th row | Common Brick |
| 5th row | Common Brick |
Common Values
| Value | Count | Frequency (%) |
| Common Brick | 34433 | |
| Brick/Siding | 2915 | 6.4% |
| Vinyl Siding | 2317 | 5.1% |
| Wood Siding | 1767 | 3.9% |
| Stucco | 1292 | 2.8% |
| Brick Veneer | 483 | 1.1% |
| Shingle | 378 | 0.8% |
| Face Brick | 369 | 0.8% |
| Brick/Stucco | 343 | 0.7% |
| Aluminum | 324 | 0.7% |
| Other values (13) | 1190 | 2.6% |
Length
| Value | Count | Frequency (%) |
| brick | 35285 | |
| common | 34433 | |
| siding | 4103 | 4.8% |
| brick/siding | 2915 | 3.4% |
| vinyl | 2317 | 2.7% |
| wood | 1767 | 2.1% |
| stucco | 1312 | 1.5% |
| veneer | 603 | 0.7% |
| stone | 416 | 0.5% |
| shingle | 378 | 0.4% |
| Other values (13) | 1837 | 2.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 75376 | |
| m | 69514 | |
| i | 56263 | |
| n | 46339 | |
| c | 42897 | |
| r | 39622 | |
| 39555 | ||
| B | 38905 | |
| k | 38905 | |
| C | 34490 | |
| Other values (22) | 51772 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 400965 | |
| Uppercase Letter | 89242 | 16.7% |
| Space Separator | 39555 | 7.4% |
| Other Punctuation | 3876 | 0.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 75376 | |
| m | 69514 | |
| i | 56263 | |
| n | 46339 | |
| c | 42897 | |
| r | 39622 | |
| k | 38905 | |
| d | 9066 | 2.3% |
| g | 7571 | 1.9% |
| e | 3730 | 0.9% |
| Other values (9) | 11682 | 2.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 38905 | |
| C | 34490 | |
| S | 10388 | 11.6% |
| V | 2920 | 3.3% |
| W | 1767 | 2.0% |
| F | 369 | 0.4% |
| A | 325 | 0.4% |
| H | 52 | 0.1% |
| M | 19 | < 0.1% |
| D | 6 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 39555 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 3876 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 490207 | |
| Common | 43431 | 8.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 75376 | |
| m | 69514 | |
| i | 56263 | |
| n | 46339 | |
| c | 42897 | |
| r | 39622 | |
| B | 38905 | |
| k | 38905 | |
| C | 34490 | |
| S | 10388 | 2.1% |
| Other values (20) | 37508 |
Common
| Value | Count | Frequency (%) |
| 39555 | ||
| / | 3876 | 8.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 533638 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 75376 | |
| m | 69514 | |
| i | 56263 | |
| n | 46339 | |
| c | 42897 | |
| r | 39622 | |
| 39555 | ||
| B | 38905 | |
| k | 38905 | |
| C | 34490 | |
| Other values (22) | 51772 |
| Distinct | 16 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 358.0 KiB |
| Built Up | |
|---|---|
| Metal- Sms | |
| Comp Shingle | |
| Slate | |
| Neopren | 795 |
| Other values (11) | 1003 |
Length
| Max length | 14 |
|---|---|
| Median length | 10 |
| Mean length | 9.347231014 |
| Min length | 5 |
Characters and Unicode
| Total characters | 428206 |
|---|---|
| Distinct characters | 32 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Metal- Sms |
|---|---|
| 2nd row | Built Up |
| 3rd row | Built Up |
| 4th row | Built Up |
| 5th row | Metal- Sms |
Common Values
| Value | Count | Frequency (%) |
| Built Up | 13621 | |
| Metal- Sms | 13013 | |
| Comp Shingle | 12706 | |
| Slate | 4673 | 10.2% |
| Neopren | 795 | 1.7% |
| Shake | 303 | 0.7% |
| Clay Tile | 247 | 0.5% |
| Shingle | 194 | 0.4% |
| Metal- Pre | 104 | 0.2% |
| Typical | 70 | 0.2% |
| Other values (6) | 85 | 0.2% |
Length
| Value | Count | Frequency (%) |
| built | 13621 | |
| up | 13621 | |
| metal | 13136 | |
| sms | 13013 | |
| shingle | 12900 | |
| comp | 12706 | |
| slate | 4673 | 5.5% |
| neopren | 795 | 0.9% |
| shake | 303 | 0.4% |
| tile | 251 | 0.3% |
| Other values (11) | 567 | 0.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 44898 | 10.5% |
| 39775 | 9.3% | |
| e | 32969 | 7.7% |
| t | 31493 | 7.4% |
| S | 30892 | 7.2% |
| p | 27267 | 6.4% |
| i | 26954 | 6.3% |
| m | 25775 | 6.0% |
| a | 18431 | 4.3% |
| n | 13756 | 3.2% |
| Other values (22) | 135996 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 289703 | |
| Uppercase Letter | 85589 | 20.0% |
| Space Separator | 39775 | 9.3% |
| Dash Punctuation | 13139 | 3.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 44898 | |
| e | 32969 | |
| t | 31493 | |
| p | 27267 | |
| i | 26954 | |
| m | 25775 | |
| a | 18431 | |
| n | 13756 | 4.7% |
| o | 13740 | 4.7% |
| u | 13621 | 4.7% |
| Other values (9) | 40799 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 30892 | |
| B | 13621 | |
| U | 13621 | |
| M | 13136 | |
| C | 13033 | |
| N | 795 | 0.9% |
| T | 321 | 0.4% |
| P | 106 | 0.1% |
| R | 56 | 0.1% |
| W | 5 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 13139 |
Space Separator
| Value | Count | Frequency (%) |
| 39775 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 375292 | |
| Common | 52914 | 12.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 44898 | |
| e | 32969 | 8.8% |
| t | 31493 | 8.4% |
| S | 30892 | 8.2% |
| p | 27267 | 7.3% |
| i | 26954 | 7.2% |
| m | 25775 | 6.9% |
| a | 18431 | 4.9% |
| n | 13756 | 3.7% |
| o | 13740 | 3.7% |
| Other values (20) | 109117 |
Common
| Value | Count | Frequency (%) |
| 39775 | ||
| - | 13139 | 24.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 428206 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 44898 | 10.5% |
| 39775 | 9.3% | |
| e | 32969 | 7.7% |
| t | 31493 | 7.4% |
| S | 30892 | 7.2% |
| p | 27267 | 6.4% |
| i | 26954 | 6.3% |
| m | 25775 | 6.0% |
| a | 18431 | 4.3% |
| n | 13756 | 3.2% |
| Other values (22) | 135996 |
INTWALL
Categorical
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 358.0 KiB |
| Hardwood | |
|---|---|
| Hardwood/Carp | |
| Wood Floor | 2936 |
| Carpet | 1404 |
| Lt Concrete | 37 |
| Other values (7) | 94 |
Length
| Max length | 13 |
|---|---|
| Median length | 8 |
| Mean length | 8.730588723 |
| Min length | 6 |
Characters and Unicode
| Total characters | 399957 |
|---|---|
| Distinct characters | 33 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Hardwood |
|---|---|
| 2nd row | Hardwood |
| 3rd row | Hardwood |
| 4th row | Hardwood |
| 5th row | Hardwood |
Common Values
| Value | Count | Frequency (%) |
| Hardwood | 35309 | |
| Hardwood/Carp | 6031 | 13.2% |
| Wood Floor | 2936 | 6.4% |
| Carpet | 1404 | 3.1% |
| Lt Concrete | 37 | 0.1% |
| Ceramic Tile | 36 | 0.1% |
| Default | 28 | 0.1% |
| Parquet | 9 | < 0.1% |
| Vinyl Comp | 7 | < 0.1% |
| Resiliant | 6 | < 0.1% |
| Other values (2) | 8 | < 0.1% |
Length
| Value | Count | Frequency (%) |
| hardwood | 35309 | |
| hardwood/carp | 6031 | 12.4% |
| floor | 2936 | 6.0% |
| wood | 2936 | 6.0% |
| carpet | 1404 | 2.9% |
| lt | 37 | 0.1% |
| concrete | 37 | 0.1% |
| tile | 36 | 0.1% |
| ceramic | 36 | 0.1% |
| default | 28 | 0.1% |
| Other values (6) | 42 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 94471 | |
| d | 85616 | |
| r | 51799 | |
| a | 48857 | |
| H | 41340 | |
| w | 41340 | |
| C | 7515 | 1.9% |
| p | 7442 | 1.9% |
| / | 6031 | 1.5% |
| 3021 | 0.8% | |
| Other values (23) | 12525 | 3.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 336042 | |
| Uppercase Letter | 54863 | 13.7% |
| Other Punctuation | 6031 | 1.5% |
| Space Separator | 3021 | 0.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 94471 | |
| d | 85616 | |
| r | 51799 | |
| a | 48857 | |
| w | 41340 | |
| p | 7442 | 2.2% |
| l | 3018 | 0.9% |
| e | 1606 | 0.5% |
| t | 1526 | 0.5% |
| i | 96 | < 0.1% |
| Other values (10) | 271 | 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 41340 | |
| C | 7515 | 13.7% |
| W | 2936 | 5.4% |
| F | 2936 | 5.4% |
| T | 39 | 0.1% |
| L | 37 | 0.1% |
| D | 28 | 0.1% |
| V | 12 | < 0.1% |
| P | 9 | < 0.1% |
| R | 6 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 3021 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 6031 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 390905 | |
| Common | 9052 | 2.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 94471 | |
| d | 85616 | |
| r | 51799 | |
| a | 48857 | |
| H | 41340 | |
| w | 41340 | |
| C | 7515 | 1.9% |
| p | 7442 | 1.9% |
| l | 3018 | 0.8% |
| W | 2936 | 0.8% |
| Other values (21) | 6571 | 1.7% |
Common
| Value | Count | Frequency (%) |
| / | 6031 | |
| 3021 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 399957 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 94471 | |
| d | 85616 | |
| r | 51799 | |
| a | 48857 | |
| H | 41340 | |
| w | 41340 | |
| C | 7515 | 1.9% |
| p | 7442 | 1.9% |
| / | 6031 | 1.5% |
| 3021 | 0.8% | |
| Other values (23) | 12525 | 3.1% |
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.227936522 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 24 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 358.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.5899153747 |
|---|---|
| Coefficient of variation (CV) | 0.4804119466 |
| Kurtosis | 10.77401125 |
| Mean | 1.227936522 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.14934249 |
| Sum | 56253 |
| Variance | 0.3480001493 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 38066 | |
| 2 | 6134 | 13.4% |
| 4 | 1141 | 2.5% |
| 3 | 439 | 1.0% |
| 0 | 24 | 0.1% |
| 5 | 4 | < 0.1% |
| 6 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 24 | 0.1% |
| 1 | 38066 | |
| 2 | 6134 | 13.4% |
| 3 | 439 | 1.0% |
| 4 | 1141 | 2.5% |
| 5 | 4 | < 0.1% |
| 6 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 6 | 3 | < 0.1% |
| 5 | 4 | < 0.1% |
| 4 | 1141 | 2.5% |
| 3 | 439 | 1.0% |
| 2 | 6134 | 13.4% |
| 1 | 38066 | |
| 0 | 24 | 0.1% |
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.6825871516 |
| Minimum | 0 |
|---|---|
| Maximum | 13 |
| Zeros | 24362 |
| Zeros (%) | 53.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 358.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 13 |
| Range | 13 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.9257232524 |
|---|---|
| Coefficient of variation (CV) | 1.356197887 |
| Kurtosis | 8.175808408 |
| Mean | 0.6825871516 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.048501274 |
| Sum | 31270 |
| Variance | 0.8569635401 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 24362 | |
| 1 | 14554 | |
| 2 | 5073 | 11.1% |
| 3 | 1162 | 2.5% |
| 4 | 405 | 0.9% |
| 5 | 145 | 0.3% |
| 6 | 73 | 0.2% |
| 7 | 22 | < 0.1% |
| 8 | 5 | < 0.1% |
| 9 | 3 | < 0.1% |
| Other values (4) | 7 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 24362 | |
| 1 | 14554 | |
| 2 | 5073 | 11.1% |
| 3 | 1162 | 2.5% |
| 4 | 405 | 0.9% |
| 5 | 145 | 0.3% |
| 6 | 73 | 0.2% |
| 7 | 22 | < 0.1% |
| 8 | 5 | < 0.1% |
| 9 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 13 | 2 | < 0.1% |
| 12 | 1 | < 0.1% |
| 11 | 2 | < 0.1% |
| 10 | 2 | < 0.1% |
| 9 | 3 | < 0.1% |
| 8 | 5 | < 0.1% |
| 7 | 22 | < 0.1% |
| 6 | 73 | 0.2% |
| 5 | 145 | 0.3% |
| 4 | 405 |
| Distinct | 7834 |
|---|---|
| Distinct (%) | 17.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3154.642182 |
| Minimum | 0 |
|---|---|
| Maximum | 155905 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 358.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 895 |
| Q1 | 1501 |
| median | 2178 |
| Q3 | 4000 |
| 95-th percentile | 7985 |
| Maximum | 155905 |
| Range | 155905 |
| Interquartile range (IQR) | 2499 |
Descriptive statistics
| Standard deviation | 2975.201257 |
|---|---|
| Coefficient of variation (CV) | 0.9431184537 |
| Kurtosis | 216.2697942 |
| Mean | 3154.642182 |
| Median Absolute Deviation (MAD) | 880 |
| Skewness | 7.753024619 |
| Sum | 144517313 |
| Variance | 8851822.518 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1800 | 473 | 1.0% |
| 2000 | 459 | 1.0% |
| 4000 | 347 | 0.8% |
| 1600 | 339 | 0.7% |
| 5000 | 331 | 0.7% |
| 1700 | 259 | 0.6% |
| 1440 | 257 | 0.6% |
| 1500 | 244 | 0.5% |
| 1620 | 215 | 0.5% |
| 2500 | 214 | 0.5% |
| Other values (7824) | 42673 |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 216 | 1 | < 0.1% |
| 255 | 1 | < 0.1% |
| 288 | 1 | < 0.1% |
| 294 | 1 | < 0.1% |
| 327 | 3 | |
| 331 | 1 | < 0.1% |
| 350 | 2 | |
| 353 | 1 | < 0.1% |
| 357 | 2 |
| Value | Count | Frequency (%) |
| 155905 | 1 | |
| 95370 | 1 | |
| 73771 | 1 | |
| 67727 | 1 | |
| 64205 | 1 | |
| 61420 | 1 | |
| 57831 | 1 | |
| 54335 | 1 | |
| 53034 | 1 | |
| 46304 | 1 |
| Distinct | 21 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 20011.47587 |
| Minimum | 20001 |
|---|---|
| Maximum | 20052 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 358.0 KiB |
Quantile statistics
| Minimum | 20001 |
|---|---|
| 5-th percentile | 20001 |
| Q1 | 20003 |
| median | 20011 |
| Q3 | 20017 |
| 95-th percentile | 20020 |
| Maximum | 20052 |
| Range | 51 |
| Interquartile range (IQR) | 14 |
Descriptive statistics
| Standard deviation | 7.600664145 |
|---|---|
| Coefficient of variation (CV) | 0.0003798152717 |
| Kurtosis | 0.2993417555 |
| Mean | 20011.47587 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | 0.5642634287 |
| Sum | 916745721 |
| Variance | 57.77009545 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 20002 | 6241 | |
| 20011 | 5934 | |
| 20019 | 4431 | |
| 20003 | 3242 | 7.1% |
| 20016 | 3176 | 6.9% |
| 20007 | 3116 | 6.8% |
| 20020 | 2679 | 5.8% |
| 20001 | 2438 | 5.3% |
| 20015 | 2292 | 5.0% |
| 20018 | 2070 | 4.5% |
| Other values (11) | 10192 |
| Value | Count | Frequency (%) |
| 20001 | 2438 | 5.3% |
| 20002 | 6241 | |
| 20003 | 3242 | |
| 20005 | 104 | 0.2% |
| 20007 | 3116 | |
| 20008 | 1568 | 3.4% |
| 20009 | 1599 | 3.5% |
| 20010 | 1800 | 3.9% |
| 20011 | 5934 | |
| 20012 | 1372 | 3.0% |
| Value | Count | Frequency (%) |
| 20052 | 6 | < 0.1% |
| 20037 | 166 | 0.4% |
| 20036 | 67 | 0.1% |
| 20032 | 1427 | 3.1% |
| 20024 | 272 | 0.6% |
| 20020 | 2679 | |
| 20019 | 4431 | |
| 20018 | 2070 | |
| 20017 | 1811 | |
| 20016 | 3176 |
| Distinct | 45386 |
|---|---|
| Distinct (%) | 99.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 38.91697554 |
| Minimum | 38.81995335 |
|---|---|
| Maximum | 38.9954352 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 358.0 KiB |
Quantile statistics
| Minimum | 38.81995335 |
|---|---|
| 5-th percentile | 38.85881964 |
| Q1 | 38.89372785 |
| median | 38.91720139 |
| Q3 | 38.94337687 |
| 95-th percentile | 38.96777483 |
| Maximum | 38.9954352 |
| Range | 0.17548185 |
| Interquartile range (IQR) | 0.049649025 |
Descriptive statistics
| Standard deviation | 0.03338690058 |
|---|---|
| Coefficient of variation (CV) | 0.0008579007006 |
| Kurtosis | -0.2883711537 |
| Mean | 38.91697554 |
| Median Absolute Deviation (MAD) | 0.02465457 |
| Skewness | -0.2789265642 |
| Sum | 1782825.567 |
| Variance | 0.001114685131 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 38.89969609 | 40 | 0.1% |
| 38.96050651 | 10 | < 0.1% |
| 38.90274731 | 9 | < 0.1% |
| 38.92702875 | 9 | < 0.1% |
| 38.92688399 | 8 | < 0.1% |
| 38.92650681 | 7 | < 0.1% |
| 38.92335415 | 5 | < 0.1% |
| 38.92669851 | 5 | < 0.1% |
| 38.91352966 | 5 | < 0.1% |
| 38.89629849 | 4 | < 0.1% |
| Other values (45376) | 45709 |
| Value | Count | Frequency (%) |
| 38.81995335 | 1 | |
| 38.82006029 | 1 | |
| 38.82021867 | 1 | |
| 38.820272 | 1 | |
| 38.82038362 | 1 | |
| 38.82075548 | 1 | |
| 38.82076875 | 1 | |
| 38.82077366 | 1 | |
| 38.82082349 | 1 | |
| 38.821041 | 1 |
| Value | Count | Frequency (%) |
| 38.9954352 | 1 | |
| 38.99489423 | 1 | |
| 38.99479729 | 1 | |
| 38.99475116 | 1 | |
| 38.99470976 | 1 | |
| 38.99457514 | 1 | |
| 38.99441351 | 1 | |
| 38.99420558 | 1 | |
| 38.99414366 | 1 | |
| 38.99412842 | 1 |
| Distinct | 45517 |
|---|---|
| Distinct (%) | 99.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -77.01259515 |
| Minimum | -77.11390873 |
|---|---|
| Maximum | -76.9097583 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 45811 |
| Negative (%) | 100.0% |
| Memory size | 358.0 KiB |
Quantile statistics
| Minimum | -77.11390873 |
|---|---|
| 5-th percentile | -77.08743656 |
| Q1 | -77.04017364 |
| median | -77.00959268 |
| Q3 | -76.98480747 |
| 95-th percentile | -76.93557863 |
| Maximum | -76.9097583 |
| Range | 0.20415043 |
| Interquartile range (IQR) | 0.055366175 |
Descriptive statistics
| Standard deviation | 0.04359272504 |
|---|---|
| Coefficient of variation (CV) | -0.000566046696 |
| Kurtosis | -0.5107156491 |
| Mean | -77.01259515 |
| Median Absolute Deviation (MAD) | 0.0264139 |
| Skewness | -0.08265261743 |
| Sum | -3528023.997 |
| Variance | 0.001900325676 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -76.95150851 | 41 | 0.1% |
| -77.07366473 | 10 | < 0.1% |
| -76.950931 | 8 | < 0.1% |
| -76.95398472 | 5 | < 0.1% |
| -77.00079769 | 5 | < 0.1% |
| -77.00079383 | 4 | < 0.1% |
| -77.06689354 | 4 | < 0.1% |
| -76.99929156 | 3 | < 0.1% |
| -76.99934195 | 3 | < 0.1% |
| -76.99639967 | 3 | < 0.1% |
| Other values (45507) | 45725 |
| Value | Count | Frequency (%) |
| -77.11390873 | 1 | |
| -77.1138097 | 1 | |
| -77.11377421 | 1 | |
| -77.113389 | 1 | |
| -77.11332066 | 1 | |
| -77.11314986 | 1 | |
| -77.11304878 | 1 | |
| -77.11270964 | 1 | |
| -77.11263572 | 1 | |
| -77.11252689 | 1 |
| Value | Count | Frequency (%) |
| -76.9097583 | 1 | |
| -76.90984266 | 1 | |
| -76.9099699 | 1 | |
| -76.90998346 | 1 | |
| -76.91012579 | 1 | |
| -76.9102437 | 1 | |
| -76.9102789 | 1 | |
| -76.9103339 | 1 | |
| -76.9104433 | 1 | |
| -76.91054585 | 1 |
| Distinct | 55 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 358.0 KiB |
| Old City 1 | |
|---|---|
| Petworth | 2402 |
| Chevy Chase | 2323 |
| Old City 2 | 2269 |
| Deanwood | 2155 |
| Other values (50) |
Length
| Max length | 28 |
|---|---|
| Median length | 10 |
| Mean length | 11.23313178 |
| Min length | 4 |
Characters and Unicode
| Total characters | 514601 |
|---|---|
| Distinct characters | 49 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Old City 2 |
|---|---|
| 2nd row | Old City 2 |
| 3rd row | Old City 2 |
| 4th row | Old City 2 |
| 5th row | Old City 2 |
Common Values
| Value | Count | Frequency (%) |
| Old City 1 | 6110 | 13.3% |
| Petworth | 2402 | 5.2% |
| Chevy Chase | 2323 | 5.1% |
| Old City 2 | 2269 | 5.0% |
| Deanwood | 2155 | 4.7% |
| Columbia Heights | 2123 | 4.6% |
| Brookland | 2052 | 4.5% |
| Capitol Hill | 1483 | 3.2% |
| Brightwood | 1437 | 3.1% |
| Congress Heights | 1324 | 2.9% |
| Other values (45) | 22133 |
Length
| Value | Count | Frequency (%) |
| old | 8379 | 9.9% |
| city | 8379 | 9.9% |
| heights | 6793 | 8.0% |
| 1 | 6198 | 7.3% |
| park | 4749 | 5.6% |
| petworth | 2402 | 2.8% |
| chase | 2323 | 2.7% |
| chevy | 2323 | 2.7% |
| 2 | 2269 | 2.7% |
| deanwood | 2155 | 2.5% |
| Other values (64) | 39068 |
Most occurring characters
| Value | Count | Frequency (%) |
| 39227 | 7.6% | |
| t | 38525 | 7.5% |
| e | 37125 | 7.2% |
| i | 36941 | 7.2% |
| o | 33862 | 6.6% |
| l | 31228 | 6.1% |
| a | 28703 | 5.6% |
| r | 27663 | 5.4% |
| d | 23379 | 4.5% |
| s | 20374 | 4.0% |
| Other values (39) | 197574 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 388673 | |
| Uppercase Letter | 75781 | 14.7% |
| Space Separator | 39227 | 7.6% |
| Decimal Number | 10047 | 2.0% |
| Other Punctuation | 785 | 0.2% |
| Dash Punctuation | 88 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 38525 | |
| e | 37125 | |
| i | 36941 | |
| o | 33862 | 8.7% |
| l | 31228 | 8.0% |
| a | 28703 | 7.4% |
| r | 27663 | 7.1% |
| d | 23379 | 6.0% |
| s | 20374 | 5.2% |
| n | 20057 | 5.2% |
| Other values (13) | 90816 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 19914 | |
| H | 9532 | |
| P | 9004 | |
| O | 8582 | |
| B | 4812 | 6.3% |
| D | 3269 | 4.3% |
| F | 2417 | 3.2% |
| R | 2138 | 2.8% |
| G | 1974 | 2.6% |
| M | 1845 | 2.4% |
| Other values (10) | 12294 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 6988 | |
| 2 | 2269 | 22.6% |
| 6 | 790 | 7.9% |
Space Separator
| Value | Count | Frequency (%) |
| 39227 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 88 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 785 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 464454 | |
| Common | 50147 | 9.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 38525 | 8.3% |
| e | 37125 | 8.0% |
| i | 36941 | 8.0% |
| o | 33862 | 7.3% |
| l | 31228 | 6.7% |
| a | 28703 | 6.2% |
| r | 27663 | 6.0% |
| d | 23379 | 5.0% |
| s | 20374 | 4.4% |
| n | 20057 | 4.3% |
| Other values (33) | 166597 |
Common
| Value | Count | Frequency (%) |
| 39227 | ||
| 1 | 6988 | 13.9% |
| 2 | 2269 | 4.5% |
| 6 | 790 | 1.6% |
| . | 785 | 1.6% |
| - | 88 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 514601 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 39227 | 7.6% | |
| t | 38525 | 7.5% |
| e | 37125 | 7.2% |
| i | 36941 | 7.2% |
| o | 33862 | 6.6% |
| l | 31228 | 6.1% |
| a | 28703 | 5.6% |
| r | 27663 | 5.4% |
| d | 23379 | 4.5% |
| s | 20374 | 4.0% |
| Other values (39) | 197574 |
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 358.0 KiB |
| Ward 4 | |
|---|---|
| Ward 6 | |
| Ward 5 | |
| Ward 3 | |
| Ward 7 | |
| Other values (3) |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 6 |
| Min length | 6 |
Characters and Unicode
| Total characters | 274866 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Ward 2 |
|---|---|
| 2nd row | Ward 2 |
| 3rd row | Ward 2 |
| 4th row | Ward 2 |
| 5th row | Ward 2 |
Common Values
| Value | Count | Frequency (%) |
| Ward 4 | 8331 | |
| Ward 6 | 7945 | |
| Ward 5 | 7338 | |
| Ward 3 | 6925 | |
| Ward 7 | 5616 | |
| Ward 1 | 3550 | |
| Ward 8 | 3228 | 7.0% |
| Ward 2 | 2878 | 6.3% |
Length
Pie chart
| Value | Count | Frequency (%) |
| ward | 45811 | |
| 4 | 8331 | 9.1% |
| 6 | 7945 | 8.7% |
| 5 | 7338 | 8.0% |
| 3 | 6925 | 7.6% |
| 7 | 5616 | 6.1% |
| 1 | 3550 | 3.9% |
| 8 | 3228 | 3.5% |
| 2 | 2878 | 3.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| W | 45811 | |
| a | 45811 | |
| r | 45811 | |
| d | 45811 | |
| 45811 | ||
| 4 | 8331 | 3.0% |
| 6 | 7945 | 2.9% |
| 5 | 7338 | 2.7% |
| 3 | 6925 | 2.5% |
| 7 | 5616 | 2.0% |
| Other values (3) | 9656 | 3.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 137433 | |
| Uppercase Letter | 45811 | 16.7% |
| Space Separator | 45811 | 16.7% |
| Decimal Number | 45811 | 16.7% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 8331 | |
| 6 | 7945 | |
| 5 | 7338 | |
| 3 | 6925 | |
| 7 | 5616 | |
| 1 | 3550 | |
| 8 | 3228 | 7.0% |
| 2 | 2878 | 6.3% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 45811 | |
| r | 45811 | |
| d | 45811 |
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 45811 |
Space Separator
| Value | Count | Frequency (%) |
| 45811 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 183244 | |
| Common | 91622 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 45811 | ||
| 4 | 8331 | 9.1% |
| 6 | 7945 | 8.7% |
| 5 | 7338 | 8.0% |
| 3 | 6925 | 7.6% |
| 7 | 5616 | 6.1% |
| 1 | 3550 | 3.9% |
| 8 | 3228 | 3.5% |
| 2 | 2878 | 3.1% |
Latin
| Value | Count | Frequency (%) |
| W | 45811 | |
| a | 45811 | |
| r | 45811 | |
| d | 45811 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 274866 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| W | 45811 | |
| a | 45811 | |
| r | 45811 | |
| d | 45811 | |
| 45811 | ||
| 4 | 8331 | 3.0% |
| 6 | 7945 | 2.9% |
| 5 | 7338 | 2.7% |
| 3 | 6925 | 2.5% |
| 7 | 5616 | 2.0% |
| Other values (3) | 9656 | 3.5% |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 358.0 KiB |
| NW | |
|---|---|
| NE | |
| SE | |
| SW | 540 |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 91622 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NW |
|---|---|
| 2nd row | NW |
| 3rd row | NW |
| 4th row | NW |
| 5th row | NW |
Common Values
| Value | Count | Frequency (%) |
| NW | 22562 | |
| NE | 13769 | |
| SE | 8940 | 19.5% |
| SW | 540 | 1.2% |
Length
Pie chart
| Value | Count | Frequency (%) |
| nw | 22562 | |
| ne | 13769 | |
| se | 8940 | 19.5% |
| sw | 540 | 1.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 36331 | |
| W | 23102 | |
| E | 22709 | |
| S | 9480 | 10.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 91622 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 36331 | |
| W | 23102 | |
| E | 22709 | |
| S | 9480 | 10.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 91622 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 36331 | |
| W | 23102 | |
| E | 22709 | |
| S | 9480 | 10.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 91622 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 36331 | |
| W | 23102 | |
| E | 22709 | |
| S | 9480 | 10.3% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| Unnamed: 0 | BATHRM | HF_BATHRM | HEAT | AC | YR_RMDL | EYB | PRICE | SALE_NUM | STYLE | STRUCT | GRADE | CNDTN | EXTWALL | ROOF | INTWALL | KITCHENS | FIREPLACES | LANDAREA | ZIPCODE | LATITUDE | LONGITUDE | ASSESSMENT_NBHD | WARD | QUADRANT | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 4 | 0 | Warm Cool | 1 | 1988.0 | 1972 | 1095000.0 | 1 | 3 Story | Row Inside | Very Good | Good | Common Brick | Metal- Sms | Hardwood | 2.0 | 5 | 1680 | 20009.0 | 38.914680 | -77.040832 | Old City 2 | Ward 2 | NW |
| 1 | 2 | 3 | 1 | Hot Water Rad | 1 | 2009.0 | 1984 | 2100000.0 | 3 | 3 Story | Row Inside | Very Good | Very Good | Common Brick | Built Up | Hardwood | 2.0 | 4 | 1680 | 20009.0 | 38.914684 | -77.040678 | Old City 2 | Ward 2 | NW |
| 2 | 3 | 3 | 1 | Hot Water Rad | 1 | 2003.0 | 1984 | 1602000.0 | 1 | 3 Story | Row Inside | Very Good | Good | Common Brick | Built Up | Hardwood | 2.0 | 3 | 1680 | 20009.0 | 38.914683 | -77.040629 | Old City 2 | Ward 2 | NW |
| 3 | 5 | 3 | 2 | Hot Water Rad | 1 | 0.0 | 1972 | 1950000.0 | 1 | 4 Story | Row Inside | Very Good | Good | Common Brick | Built Up | Hardwood | 1.0 | 4 | 2196 | 20009.0 | 38.914331 | -77.039715 | Old City 2 | Ward 2 | NW |
| 4 | 7 | 3 | 1 | Hot Water Rad | 1 | 2011.0 | 1972 | 1050000.0 | 1 | 3 Story | Row Inside | Very Good | Average | Common Brick | Metal- Sms | Hardwood | 2.0 | 1 | 1627 | 20009.0 | 38.915408 | -77.040129 | Old City 2 | Ward 2 | NW |
| 5 | 8 | 3 | 1 | Warm Cool | 1 | 2008.0 | 1967 | 1430000.0 | 4 | 2 Story | Row Inside | Above Average | Very Good | Common Brick | Built Up | Hardwood | 2.0 | 1 | 1424 | 20009.0 | 38.915017 | -77.039903 | Old City 2 | Ward 2 | NW |
| 6 | 14 | 3 | 1 | Warm Cool | 1 | 2000.0 | 1967 | 1325000.0 | 1 | 2 Story | Row Inside | Above Average | Very Good | Stucco | Metal- Sms | Hardwood | 2.0 | 1 | 1815 | 20009.0 | 38.915038 | -77.039716 | Old City 2 | Ward 2 | NW |
| 7 | 16 | 3 | 1 | Warm Cool | 1 | 2006.0 | 1967 | 1240000.0 | 1 | 2 Story | Row Inside | Above Average | Very Good | Common Brick | Metal- Sms | Hardwood | 1.0 | 0 | 1424 | 20009.0 | 38.915018 | -77.039844 | Old City 2 | Ward 2 | NW |
| 8 | 19 | 3 | 1 | Hot Water Rad | 1 | 2013.0 | 1969 | 592250.0 | 1 | 2 Story | Row Inside | Good Quality | Good | Common Brick | Built Up | Hardwood | 2.0 | 1 | 1424 | 20009.0 | 38.915019 | -77.040138 | Old City 2 | Ward 2 | NW |
| 9 | 22 | 1 | 0 | Forced Air | 1 | 2010.0 | 1967 | 907400.0 | 1 | 2 Story | Semi-Detached | Above Average | Good | Common Brick | Built Up | Hardwood | 1.0 | 0 | 2090 | 20009.0 | 38.911368 | -77.033925 | Old City 2 | Ward 2 | NW |
Last rows
| Unnamed: 0 | BATHRM | HF_BATHRM | HEAT | AC | YR_RMDL | EYB | PRICE | SALE_NUM | STYLE | STRUCT | GRADE | CNDTN | EXTWALL | ROOF | INTWALL | KITCHENS | FIREPLACES | LANDAREA | ZIPCODE | LATITUDE | LONGITUDE | ASSESSMENT_NBHD | WARD | QUADRANT | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 45801 | 106656 | 2 | 1 | Hot Water Rad | 1 | 2007.0 | 1964 | 227000.0 | 6 | 2 Story | Row Inside | Average | Good | Common Brick | Built Up | Hardwood | 1.0 | 0 | 1733 | 20032.0 | 38.848216 | -76.997142 | Congress Heights | Ward 8 | SE |
| 45802 | 106657 | 2 | 1 | Hot Water Rad | 1 | 2008.0 | 1957 | 140496.0 | 1 | 2 Story | Row Inside | Average | Good | Common Brick | Metal- Sms | Hardwood | 1.0 | 1 | 1734 | 20032.0 | 38.848213 | -76.997192 | Congress Heights | Ward 8 | SE |
| 45803 | 106662 | 1 | 1 | Warm Cool | 1 | 0.0 | 1976 | 120000.0 | 1 | 3 Story | Row Inside | Average | Average | Wood Siding | Built Up | Wood Floor | 1.0 | 0 | 2799 | 20032.0 | 38.825131 | -76.997396 | Congress Heights | Ward 8 | SE |
| 45804 | 106663 | 1 | 1 | Warm Cool | 1 | 0.0 | 1976 | 109256.0 | 1 | 3 Story | Row Inside | Average | Average | Common Brick | Built Up | Wood Floor | 1.0 | 0 | 2579 | 20032.0 | 38.825080 | -76.997430 | Congress Heights | Ward 8 | SE |
| 45805 | 106664 | 1 | 1 | Warm Cool | 1 | 2013.0 | 1988 | 230000.0 | 1 | 3 Story | Row Inside | Average | Good | Common Brick | Built Up | Wood Floor | 1.0 | 0 | 2359 | 20032.0 | 38.825026 | -76.997449 | Congress Heights | Ward 8 | SE |
| 45806 | 106666 | 1 | 1 | Warm Cool | 1 | 2013.0 | 1988 | 215000.0 | 1 | 3 Story | Row Inside | Average | Good | Common Brick | Built Up | Wood Floor | 1.0 | 0 | 1919 | 20032.0 | 38.824922 | -76.997489 | Congress Heights | Ward 8 | SE |
| 45807 | 106668 | 2 | 1 | Forced Air | 1 | 0.0 | 1988 | 205000.0 | 1 | 3 Story | Row Inside | Average | Average | Common Brick | Built Up | Wood Floor | 1.0 | 0 | 2513 | 20032.0 | 38.824805 | -76.997597 | Congress Heights | Ward 8 | SE |
| 45808 | 106672 | 3 | 0 | Forced Air | 0 | 0.0 | 1963 | 100000.0 | 1 | 2 Story | Multi | Average | Average | Common Brick | Comp Shingle | Hardwood | 3.0 | 0 | 4374 | 20032.0 | 38.820755 | -77.007009 | Congress Heights | Ward 8 | SW |
| 45809 | 106673 | 3 | 0 | Forced Air | 0 | 2002.0 | 1988 | 103000.0 | 1 | 2 Story | Multi | Average | Good | Common Brick | Comp Shingle | Hardwood | 3.0 | 0 | 4523 | 20032.0 | 38.820823 | -77.007013 | Congress Heights | Ward 8 | SW |
| 45810 | 106687 | 2 | 0 | Forced Air | 0 | 0.0 | 1962 | 95000.0 | 1 | 2 Story | Multi | Average | Average | Common Brick | Comp Shingle | Carpet | 2.0 | 0 | 5837 | 20032.0 | 38.821855 | -77.005828 | Congress Heights | Ward 8 | SW |